Preprocessing Time Series Data for Classification with Application to CRM
نویسندگان
چکیده
We develop an innovative data preprocessing algorithm for classifying customers using unbalanced time series data. This problem is directly motivated by an application whose aim is to uncover the customers’ churning behavior in the telecommunication industry. We model this problem as a sequential classification problem, and present an effective solution for solving the challenging problem, where the elements in the sequences are of a multi-dimensional nature, the sequences are uneven in length and classes of the data are highly unbalanced. Our solution is to integrate model based clustering and develop an innovative data preprocessing algorithm for the time series data. In this paper, we provide the theory and algorithms for the task, and empirically demonstrate that the method is effective in determining the customer class for CRM applications in the telecommunications industry.
منابع مشابه
Application of Artificial Neural Networks in a Two-step Classification for Acute Lymphocytic Leukemia Diagnosis by Blood Lamella Images
Introduction: This study aimed to present a system based on intelligent models that can enhance the accuracy of diagnostic systems for acute leukemia. The three parts including preprocessing, feature extraction, and classification network are considered as associated series of actions. Therefore, any dysfunction or poor accuracy in each part might lead in general dysfunction of...
متن کاملتعیین سطح شالیزارهای حاشیه زاینده رود در منطقه اصفهان با دادههای رقومی سنجندههای ماهواره IRS
To detect the rice paddis areas in Isfahan region, the IRS-1D data from PAN, LISS III and WiFS time series were used. Geometric, atmospheric, radiometric and topographic corrections were applied to various images from 2003 to 2004. Necessary preprocessing and various analyses as well as time series composite image analyses were applied and field sampling was done for appropriate times in 2003 a...
متن کاملتعیین سطح شالیزارهای حاشیه زاینده رود در منطقه اصفهان با دادههای رقومی سنجندههای ماهواره IRS
To detect the rice paddis areas in Isfahan region, the IRS-1D data from PAN, LISS III and WiFS time series were used. Geometric, atmospheric, radiometric and topographic corrections were applied to various images from 2003 to 2004. Necessary preprocessing and various analyses as well as time series composite image analyses were applied and field sampling was done for appropriate times in 2003 a...
متن کاملEnhancing Learning from Imbalanced Classes via Data Preprocessing: A Data-Driven Application in Metabolomics Data Mining
This paper presents a data mining application in metabolomics. It aims at building an enhanced machine learning classifier that can be used for diagnosing cachexia syndrome and identifying its involved biomarkers. To achieve this goal, a data-driven analysis is carried out using a public dataset consisting of 1H-NMR metabolite profile. This dataset suffers from the problem of imbalanced classes...
متن کاملDiscrimination of Golab apple storage time using acoustic impulse response and LDA and QDA discriminant analysis techniques
ABSTRACT- Firmness is one of the most important quality indicators for apple fruits, which is highly correlated with the storage time. The acoustic impulse response technique is one of the most commonly used nondestructive detection methods for evaluating apple firmness. This paper presents a non-destructive method for classification of Iranian apple (Malus domestica Borkh. cv. Golab) according...
متن کامل